Multi-lingual duration modeling
نویسندگان
چکیده
Controlling timing in text-to-speech synthesis systems is complicated, because there are many contextual factors that affect timing; moreover, which factors matter and what their precise effects are varies among languages. We describe here a language-independent approach for duration control. At run time, a language-independent timing module accesses languagespecific tables. These tables specify which sub-classes of the feature space (i.e., all combinations of context and phone identity) are homogeneous in the specific sense that the same factors have similar effects on the cases in a sub-class. Within a sub-class, durations are modeled by simple arithmetic models such as multiplicative, additive, or – more generally – sums-ofproducts models. Exploratory statistical methods (supervised) and parameter estimation techniques (unsupervised) are used for
منابع مشابه
Lingual orthodontic treatment duration: performance of two different completely customized multi-bracket appliances (Incognito and WIN) in groups with different treatment complexities
INTRODUCTION The occurrence of side-effects of fixed orthodontic therapy, such as white-spot lesions and root resorption, are known to be significantly more frequent with increasing duration of treatment. Multi-bracket treatment should be as short as possible, in order to minimize the risks of collateral damage to teeth. The aim of this non-randomized clinical trial was to compare treatment dur...
متن کاملN-Gram Language Modeling for Robust Multi-Lingual Document Classification
Statistical n-gram language modeling is used in many domains like speech recognition, language identification, machine translation, character recognition and topic classification. Most language modeling approaches work on n-grams of terms. This paper reports about ongoing research in the MEMPHIS project which employs models based on character-level n-grams instead of term n-grams. The models ar...
متن کاملMulti-lingual and Multi-modal Speech Processing and Applications
Over the last decade voice technologies for telephony and embedded solutions became much more mature, resulting in applications providing mobile access to digital information from anywhere. Both a growing demand for voice driven applications in many languages and the need for improved usability and user experience now drives the exploration of multi-lingual speech processing techniques for reco...
متن کاملExperiments in Cross Language Query Focused Multi-Document Summarization
The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual information robustly and efficiently, with as high quality performance as possible. Previous research activities on multilingual information access systems have studied cross-language information retrieval (CLIR), information ...
متن کاملFST-based recognition techniques for multi-lingual and multi-domain spontaneous speech
In this paper we present techniques for building multi-domain and multi-lingual recognizers within a finite-state transducer (FST) framework. The flexibility of the FST approach is also demonstrated on the task of incorporating networks modeling different types of non-speech events into an existing word lattice network. The ability to create robust multi-domain and/or multi-lingual recognizers ...
متن کاملOnset of action and duration of efficacy of inferior alveolar nerve block versus single lingual subperi-osteal injection of 4% articaine in mandibular second molars: A randomized clinical trial
Background and Aim: Achieving adequate pulpal anesthesia could be challenging in mandibular molars. There are some disagreements about the success rate of local infiltration anesthesia with articaine as primary injection. Therefore, the aim of this study was to assess the efficacy of 4% articaine lingual subperiosteal injection as the primary injection for permanent mandibular second molars in ...
متن کامل